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ABSTRACT 

A new serendipitous XMM survey in the area of the Sloan Digital Sky Survey is described 
(XMM/SDSS), which includes features such as the merging of overlapping fields to increase 
the sensitivity to faint sources, the use of a new parametrisation of the XMM point spread 
function for the source detection and photometry, the accurate estimation of the survey sensi- 
tivity. About 40000 X-ray point sources are detected over a total area of 122 deg^. A subsam- 
ple of 209 sources detected in the 2-8 keV spectral band with SDSS spectroscopic redshifts in 
therange0.03 < z < 0.2, optical magnitudes r < 17.77mag and log Lx(2 — 10 keV) > 41.5 
(erg/s) are selected to explore their distribution on the colour magnitude diagram. This is com- 
pared with the colour-magnitude diagram of X-ray AGN in the AEGIS field at z « 0.8. We 
find no evidence for evolution of the rest-frame colours of X-ray AGN hosts from z = 0.8 
to z = 0.1. This suggests that the dominant accretion mode of the AGN population, which is 
expected to imprint on the properties of their host galaxies, does not change since z = 0.8. 
This argues against scenarios which attribute the rapid decline of the accretion power of the 
Universe with time (1 dex since z = 0.8) to changes in the AGN fueling/triggering mode. 

Key words: astronomical data bases: surveys - methods: data analysis - galaxies: active - 
galaxies: Seyferts - X-rays: diffuse background 



1 INTRODUCTION 

Understanding the evolution of Active Galactic Nuclei (AGN), 
which signpost accretion events onto Supermassive Black Holes 
(SMBHs), remains a challenge for modern astrophysics. Although 
observations have demonstrated beyond any doubt that the lumi- 
nosity density of these systems has dropped by more than 1 or- 
der of magnitude from z ~ 1 to the present day (e.g. Ueda et aT] 



[2003i|Hasmger et al.|2005||Ebrero et al.|2009[|Aird et a .|2010^ , the 

physical mechanisms that drive this rapid decline are still not well 
constrained. The lack of a physical description for the cosmologi- 
cal evolution of SMBHs has implications that go beyond the AGN 
community. Recent evidence indicates an intimate relation between 
the building of galaxies and the growth of the SMBH at their cen- 
tres (e.g. |Ferrarese & Merritt 2000; Gebh ardt et al.|2000| . There- 
fore without a better understanding of AGN evolution our picture 
for the buildup of galaxies will also be incomplete. 

Galaxy mergers have long been proposed as the mechanism 
that triggers AGN and drives their cosmological evolution. Numer- 
ical SPH (Smoothed Particle Hydrodynamic) simulations demon- 
strate that these violent events are very efficient in furmeling gas 
to the nuclear galaxy regions (e.g. |Hernquist| [19891 [Barnes &| 
[Hemqu ist|1991||1996|, where it can be co nsumed by the SMBH 
^Springel et al.|2005||Di Matteo et al.|2005| l. Consequently in most 



semi-analytic cosmological simulations of galaxy formation (e.g. 
[Cattaneo et al.|2005||Somerville et al.|2008i|Wang et al.|2008) the 
merging of near equal mass gas rich galaxies is the primary mecha- 
nism for growing SMBHs. In this family of models the evolution of 
AGN is intimately related to the decline of the fraction of gaseous 
major mergers with redshift (e.g. |L6pez-Sanjuan et al.|2009| l. Alter- 
native scenarios propose that the dominant mode of accretion onto 
SMBHs changes with time l |Hasingerj20 08 , Fanidakis et al.|2010|l, 
thereby leading to the observed decline of the AGN space density 
since z ~ 1 — 2. It is suggested for example, that SMBHs form 
primarily in violent gaseous major merger events at high redshift 
^ 1; "QSO-mode"'Hop ldns et al.|2006| l, while stochastic accre- 
tion (e.g. internal instabilities, minor interactions) dominates the 
growth of SMBHs ("Seyfert-mode" [Hopkins & Hemquist||2006l > 
at lower redshift (z < 0.5) and produces, on average, lower lu- 
minosity systems. Another possibility proposed by [Fanidakis et al.] 
(20i0i is that the dominant AGN fueling mode shifts from disk in- 
stabilities at high redshift to halo gas accretion (also termed "radio" 
mode, [Croton et al.|2006[ > at low redshift. One of the testable pre- 
dictions of those models is that the properties of AGN hosts, such 
as the the star-formation history and the morphology, change with 
redshift, from z > 1 to z < 0.5. In contrast, in the merger only 
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driven evolution scenario, AGN should be hosted by galaxies that 
have similar properties at all redshifts. 

The study of AGN hosts as a function of redshift requires a ho- 
mogeneously selected AGN sample over a wide redshift baseline, 
2 ~ to z ~ 1 and beyond. Only then can one compare directly 
the properties of AGN hosts at different epochs. Variations in the 
selection function with redshift are hard to account for and may 
lead to erroneous conclusions. At z > 1 in particular. X-ray obser- 
vations, especially at energies > 2keV, are one of the most effi- 
cient and least biased methods for locating AGN with a selection 
function that is easy to quantify. The infrared is a promising wave- 
length regime for finding active SMBHs, although there are issues 
related to contamination of infrared selected AGN samples by star- 
[Georgantopoulos et al.' 2008[|Donley et al.|2008[[Pope| 
I Georgakakis et al. 20101. Optical spectroscopy is also a 



bursts (e.g. 



|etal.|2008| 



powerful tool for identifying AGN, but aperture effects, which are 
particularly severe at high redshifts, raise concerns about dilution 
of the AGN emission lines by the host galaxy stellar population 
(e.g. [Severgnini et al.|2003[ l. As a result, surveys with XMM and 
Chandra, both deep/pencil-beam and shallow/wide, have been the 
workhorse of the astronomy community for compiling AGN sam- 
ples. Those surveys however, essentially probe active SMBHs close 
to the peak of the accretion power of the Universe, z ^ 1, and lack 
the area to provide meaningful constraints on the AGN population 
at z < 0.5. The Sloan Digital Sky Survey (SDSS;|Abazajian et al 



[2009 ) has identified the largest sample of low redshift (z « 0.1) 
AGN todate, using diagnostic emission line ratios I jKauffmann et al.| 
|2004[ l. The selection function of that sample however, is very differ- 
ent from that of X-ray AGN in deep surveys, rendering the compar- 
ison difficult. For example, the SDSS AGN include a large number 
of LINERs (Low lonisation Nuclear Emission line Region), which 
are controversial objects and may not be powered by accretion onto 
a SMBH (e.g. |Sarzi et al.|2010^ . More relevant to high redshift X- 
ray AGN surveys are the serendipitous near all-sky AGN samples 
compiled in the nearby Universe by the high energy missions IN- 
TEGRAL (20-100 keV, total of 144 Seyfert AGN, Beckmann et al. 
and SWIFT (15-195 keV, total of 266 Seyferts, |Tueller et al.. 
Although those AGN samples are selected at much higher 
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rest-frame energies compared to X-ray sources detected by Chan- 
dra or XMM at z ~ 1 (< 15keV), they are more appropriate than 
SDSS AGN as a low redshift comparison sample. Ideally however, 
one would like to select nearby AGN at rest-frame X-ray energies 
comparable to those of Chandra/XMM surveys at jz ~ 1. 

In this paper we describe the compilation of a large sample 
of X-ray selected AGN at 2 ~ 0.1 using a new serendipitous X- 
ray survey (hereafter referred to as XMM/SDSS) in the area of the 
SDSS based on archival XMM observations. The advantage of the 
low redshift X-ray AGN subset of the XMM/SDSS survey is that 
the selection function is almost identical to deep pencil-beam sam- 
ples, thereby minimising differential selection biases. As a result 
the comparison between the AGN host galaxy properties between 
2 « 0.1 and 2 ~ 1 is greatly facilitated. The XMM archive in- 
cludes over 10 years worth of observations, which allow serendipi- 
tous surveys over many tens of square degrees on the sky. Therefore 
the volume probed by the XMM/SDSS sample at 2 « 0.1 is orders 
of magnitude larger than any wide-area survey carried out by either 
XMM or Chandra, thereby allowing detailed studies of the statis- 
tical properties of low redshift X-ray AGN. The SDSS is the field 
of choice for our serendipitous X-ray survey as it provides the es- 
sential follow-up observations for the identification of low redshift 
X-ray AGN. The SDSS optical data include 5-band photometry 
(ugriz) and extensive optical spectroscopy for galaxies at 2 ~ 0.1. 



Wide area surveys at various wavelengths in the SDSS are either 
underway or completed, thereby allowing panchromatic studies of 
the XMM/SDSS X-ray AGN over a wide redshift baseline. The 
ancillary data in the SDSS include (i) the UKIRT Infrared Deep 
Sky Survey (UKIDSS, [Lawrence et~ari[2007l l, which will cover 
4000 deg^ of the SDSS in 4 near-infrared bands to K = 18.5 mag, 
(ii) the AKARI all-sky survey in 6 mid- and far-infrared bands from 
9 - 200^m (Ishihara et al. 20101, (iii) the Herschel ATLAS far- 
infrared survey (Eales et al.,2010 ) and (iv) the FIRST survey, which 
provides deep (ImJy) radio continuum (1.4GHz) images of the en- 
tire SDSS i jBecker et al.[T995l l. 

The structure of the paper is as follows. In sections 2 to 9 we 
describe the automated pipeline developed to reduce the archival 
XMM observations that overlap with the SDSS. The data reduc- 
tion steps include the construction of event files and images in dif- 
ferent energy bands, the detection of sources, the calculation and 
application of astrometric corrections, the estimation of fluxes and 
the identification of X-ray sources with optical counterparts. The 
pipeline uses a new parametrisation of the XMM Point Spread 
Function (Appendix A), which is employed for source detection 
and photometry, and an accurate method for estimating the sen- 
sitivity of the X-ray survey to point sources. Section 10 demon- 
strates the utility of our serendipitous X-ray AGN sample by com- 
paring, for the first time, the colour magnitude diagrams of X- 
ray selected AGN at 2 ~ 0.1 and 2 « 0.8. Section 11 dis- 
cuss the results and conclusions. Throughout this paper we adopt 
Ho = 100kms~^Mpc~\ Om = 0.3 and SIa = 0.7. Rest frame 
quantities (e.g. absolute magnitudes, luminosities) are parametrised 
hyh = Ho/WO. 



2 ORGANISATION OF THE XMM DATA 

For the construction of the serendipitous X-ray source catalogue, 
XMM observations that were taken in full frame and extended full 
frame modes and were public by July 2009 (total of 3818 obser- 
vations) are selected. We then chose only those that overlap with 
the SDSS DR7 (Data R elease 7; |Abazajian et"ar][2009) . We use 
the spherical polygons ( [Hamilton & Tegmark| 12004) defined in 
the New York University Value Added Galaxy Catalogue (NYU- 
VAGC; [Blanton et al.|2005l l to describe the geometry of the SDSS 
survey. This geometry is the area covered by the SDSS-DR7 imag- 
ing survey after masking out Tycho stars as described by |Blanton| 
|et aH (2005 ). If the aimpoint of an XMM observation falls within 
one of the masked areas it is excluded from further analysis. This 
requirement reduced the number of analysed XMM observations to 
928. 

The basic unit of the survey data structure is the individual 
XMM observation, which will also be referred to as "obsid". XMM 
observations are assigned to "projects" and "pointings", which are 
defined below. Overlapping obsids are merged by the pipeline to 
maximise the sensitivity of the survey to faint sources. For that pur- 
pose the data are organised into projects which include at least one 
XMM observation. If an XMM field lies within 28 arcmin of the 
aimpoint of at least one obsid of a project, it is assigned to that 
project. Otherwise a new project is defined. The radial distance cut 
is chosen to ensure overlap between adjacent XMM fields, which 
have usable sizes of approximately 14 arcmin radius. Observations 
that belong to separate projects are therefore non overlapping. A 
project is assigned a unique name using up to the first 9 characters 
of the target name keyword of the first XMM observation assigned 
to that project. Projects with more than one XMM observations are 
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Figure 1. Demonstration of how the XMM observations are organised into 
projects and pointings. The individual XMM observations shown in the fig- 
ure belong to the same project as they are contiguous. The identification 
name of the project is SDSS1341 after the target name of the first osbid that 
was assigned to the project. The squares are 40 arcmin on the side and repre- 
sent pointings. All XMM observations that overlap with a 40 X 40 arcmin-^ 
box are merged to increase the sensitivity of the serendipitous survey to faint 
sources. The positions of the pointings are defined in a way which guaran- 
tees that the entire contiguous area of the project is covered by at least one 
pointing independent of the layout of the individual obsids 

organised in pointings, which are defined as 40 x 40 arcmin'^ boxes 
and include all the obsids of the project that overlap with that box. 
The first observation of a project defines the centre of the first point- 
ing. Project observations that lie more than 6 arcmin away from an 
existing pointing define the centres of new pointings. This choice of 
radial distance leads to substantial overlap between adjacent point- 
ings but ensures that the contiguous area of a project is covered by 
at least one pointing independent of the layout of individual ob- 
sids. This is demonstrated in Figure [T| which plots the positions of 
the pointings of a particular project in the XMM serendipitous sur- 
vey. Once the pointings of a project are defined, they are assigned 
all the project observations that lie within 28 arcmin off the cen- 
tre of each pointing. Therefore all the XMM observations that have 
at least a small fraction of their field of view overlapping with the 
40 X 40 arcmin^ pointing box belong to the same group and are 
eventually merged by the pipeline. A particular obsid of a project 
may belong to more than one pointings. 



3 DATA PROCESSING 

The XMM observations are reduced using the Science Analysis 
System (SAS) version 9.0. The first step is to produce event files 
from the Observation Data Files (ODF) using the EPCHAIN and 
EMCHAIN SAS tasks for the EPIC PN and MOS detectors respec- 
tively. This includes (i) the creation of raw event lists and Good 
Time Interval (GTI) data for each CCD of the EPIC detectors, (ii) 
the identification of bad pixels, (iii) the calibration of the raw event 
lists by flagging trailing events, performing pattern recognition, 
gain and CTI (Charge Transfer Inefficiency) corrections, (iv) the 



estimation of linearised detector and sky coordinates and (v) the 
merging of the calibrated event lists of different CCDs into event 
files for each EPIC detector. 



Pixels along the edges of the CCDs of the PN and MOS de- 
tectors are removed, as in our experience their inclusion often re- 
sults to spurious detections. The excluded pixels for the EPIC- 
PN correspond to rows with RAWX =1, 64 and columns with 
RAWY = 199 - 200 and RAWY < 24. The latter set of pixels 
are flagged out to reduce bright low energy edges ('Wats on et al.| 
(2009). For the EPIC MOSl, M0S2 we remove pixels with coordi- 
nates (RAWX, RAWY) = (1-4, 1-600), (RAWX, RAWY) = 
(597 - 600, 1 - 600), (RAWX, RAWY) = (1 - 600, 1 - 4), 
(RAWX, RAWY) = (1 - 600, 597 - 600). Events with patterns 
> 4 for the PN and > 12 for the MOS detectors are removed and 
the filter formulae (FLAG & Ox766b0808)==0 for the MOS and 
(FLAG & Ox2fb0808)==0 for the PN are applied. Those flags ex- 
clude events from offset columns and spoiled frames and remove 
electronic noise ( [Marty et al.||2003] ). Further filtering is applied 
to observations targeting bright X-ray sources, such as QSOs or 
Galactic stars (total of 47). Those observations are identified by 
visual inspection and are found to suffer from diffraction spikes 
as well as out-of-time events stripes. The targets are placed at the 
aimpoint of the XMM EPIC cameras and therefore the affected re- 
gions are fixed in detector coordinates. For the PN we remove the 
out-of-time events stripe in rows RAWX = 29 — 44 of CCD4 and 
the upper part of CCDs 4 (RAWY > 155) and 7 (RAWY > 170), 
which are affected by the diffraction spikes of the target. In the case 
of MOSl the removed pixels lie on CCD 1 (RAWX = 292 - 328 
and RAWY = 1 - 600,RAWX = 140 - 480 and RAWY = 
164 — 454). For MOS2 we remove pixels from CCD 1 with detec- 
tor coordinates RAWX = 277 - 320 and RAWY = 1 - 600, 
RAWX = 163 - 447 and RAWY = 144 - 444. 



Flaring background periods are identified using a methodol- 
ogy similar to that described in |Nandra et al.|p007) . First, sources 
are removed from the event file to avoid AGN variability biasing 
the results. The EWAVELET task of SAS with a threshold of 5 times 
the local background rms is used to detect sources in the 0.5-8 keV 
spectral band of the combined image of all EPIC detectors available 
to a particular XMM observation. These sources are then masked 
out from the event file before generating the background light curve 
by binning events in the energy range 0.2-12 keV in 20 s intervals. 
The approximate quiescent background level is determined by cal- 
culating the count rate limit at which the excess variance of the 
background light curve is minimum (c.f. Nandra et al. 2007, who 
estimates the limit where the excess variance is zero). Periods with 
background count rate 2 times higher than the level where the ex- 
cess variance is minimum are excluded from the filtered event file 
(see Figure |2j. It is verified that this methodology works well for 
XMM. It fails to identify however, cases where the entire observa- 
tion is affected by flares and hence, has an overall elevated back- 
ground. Therefore PN and MOS observations with quiescent back- 
ground levels higher than 50 and 20cts~^ respectively, are not 
analysed. It is empirically found that those values exclude prob- 
lematic observations from further analysis. Also, if the Good Time 
Interval for a particular detector, after the removal of flaring peri- 
ods, is less than 1 ks, it is excluded from further analysis. A total of 
95 obsids have been removed entirely either because of high flares 
or short clean exposure times. 
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Figure 2. Light curve of XMM observation 0041170101 whicli sliows a 
flare at the end of the observation. The dashed horizontal line marks the 
approximate quiescent background level determined by the methodology 
described in the text. The dotted line corresponds to the count rate limit 
above which events are excluded. 



4 IMAGE CONSTRUCTION 

Images in celestial coordinates with pixel size of 4.35 arcsec are 
constructed in 5 energy bands, 0.5-8 IceV (full), 0.5-2 keV (soft), 
2-8keV (hard), 5-8keV (very hard; vhrd) and 7.5-12keV (ultra 
hard; uhrd). The upper energy limit of 8 keV for the full, hard and 
vhrd band images is chosen because of the reduced effective area 
of XMM at higher energies. Also, below 0.5 keV the XMM back- 
ground is elevated due to the Galactic X-ray emission component 
( |Lumb et al.||2002") . Therefore, photons with energies < 0.5 and 
> 8 keV primarily increase the background and do not improve the 
signal-to-noise ratio of the final image, thereby limiting the sensi- 
tivity of the survey. Exposure maps, which account for vignetting, 
CCD gaps and bad pixels, are constructed at monochromatic ener- 
gies of 2 keV (full band), 1 keV (soft), 3.5 keV (hard), 6 keV (vhrd) 
and 8.5 keV (uhrd). Those values correspond to the mean photon 
energy in each spectral band of a power-law X-ray spectrum with 
r = 1.4 (i.e. similar to the diffuse X-ray background; e.g. |Hickoxl 
|& Markevi"tchl|2006| l folded through the sensitivity of the XMM 
EPIC cameras (mirror effective area, energy redistribution matrix). 

For projects with at least one pointing, the EPIC images and 
exposure maps of all the observations of a pointing are merged prior 
to source detection to maximise the sensitivity to faint sources. In 
the case of projects that consist of a single XMM observation the 
EPIC images and exposure maps are coadded. 



5 POINT SOURCE DETECTION 

X-ray point sources are searched for in the merged EPIC images of 
either a pointing or a project with a single XMM observation. The 
detection of sources is performed independently in each of the 5 
spectral bands defined in section |4] soft, full, hard, vhrd and uhrd. 
A methodology similar to that described in |Nandra et al.||2005^ 
and |Laird et al.| ( |2009[ l is adopted. Source candidates are identified 
using the wavelet based EWAVELET source detection task of SAS at 



a low threshold of 4(j above the background, where a is the rms of 
the background counts. 

For each candidate source the Poisson probability of a ran- 
dom background fluctuation is estimated. This step involved the 
extraction of the total counts at the position of the source and the 
determination of the local background value. For the extraction of 
counts a new set of Point Spread Function (PSF) estimates is em- 
ployed, which accounts for both the varying elliptical shape and 
size of the XMM PSF as a function of position on the EPIC detec- 
tors. Details of the calculation of the PSF are presented in the ap- 
pendix |A| At each source position the counts are extracted within 
an elliptical aperture that corresponds to 70 per cent of the PSF 
Encircled Energy Fraction (FEE). The total counts at a candidate 
source position, T, is the sum of the extracted counts from indi- 
vidual EPIC cameras. For each source the local background is es- 
timated by first masking out all detections within 4 arcmin of the 
source position using an elliptical aperture that corresponds to the 
80 per cent FEE ellipse. The counts from individual EPIC cam- 
eras are then extracted using elliptical annuli centred on the source 
with inner and outer semi-major axes of 5 and 15 pixels (0.36 and 
1 .09 arcmin) respectively. The position angle and ellipticity of the 
ellipses are set equal to those of the 70 per cent FEE. The mean 
local background, B, is then estimated by summing up the back- 
ground counts from individual EPIC cameras after scaling them 
down to the area of the source count extraction region. The Poisson 
probability P{T, B) that the extracted counts at the source posi- 
tion, r, are a random fluctuation of the background is calculated. 
We consider as sources the detections with P{T, B) < 4 x 10~^. 
At this cutoff less than about 0.01 false sources per XMM pointing 
are expected given the typical size of the 70 per cent FEE aper- 
ture (15-20 arcsec radius depending on energy and off-axis angle) 
and the field of view of the XMM (about 14 arcmin radius). The 
candidate source list generated by EWAVELET task includes a large 
number of background fluctuations which are masked out during 
the estimation of the local background for individual sources. This 
may lead to an underestimation of the local background, as posi- 
tive background fluctuations may be excluded as source candidates. 
We account for this potential source of bias by repeating the pro- 
cess above using as input candidate source list the one with Poisson 
probability P(T, B) < 4 x lO"'^. 

Our methodology is optimised for the detection of point 
sources. The final catalogue however, includes extended X-ray 
sources associated with hot gas from galaxy clusters or groups. 
Also, the extended X-ray emission regions of bright clusters are 
often split into multiple spurious detections by our source detec- 
tion pipeline. The SAS task EMLDETECT, although also optimised 
for the detection of point sources, can also identify extended X-ray 
emission (see Watson et al. 2009 for a discussion of limitations) 
as well as spurious detections. This task is used to perform simul- 
taneous PSF fits to the source count distribution of all the EPIC 
images of an XMM observation to estimate the detection likeli- 
hood, the source extent and the extent likelihood. EMLDETECT is 
applied separately to each of the five energy bands, soft, full, hard, 
vhrd and uhrd. The free parameters are the source count rate and 
source extent. The source positions are kept fixed during the min- 
imisation process. The background maps described in section 7 are 
used as input to the EMLDETECT task. The energy and position de- 
pendent PSF provided in the calibration database is used in the fit. 
We caution that the PSFs employed by EMLDETECT are different 
from those described in Appendix [A] The task is allowed to fit up 
to 3 sources simultaneously if they fall within the 90 per cent FEE 
radius of the calibration database PSF. For the determination of the 
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extent of a source the PSF is convolved with a /3-model profile. If 
the hkelihood of the extended model is below 3 or the extent of 
the source is less than 1.5 pixels then point source parameters are 
determined by the task. In the case of pointings with multiple obser- 
vations the EMLDETECT task is ran on each observation separately. 
The individual likelihoods for a given source are then summed up 
and transformed to equivalent likelihoods L2 corresponding to the 
case of two free parameters, as described in the EMLDETECT docu- 
mentation. Based on visual inspection of the XMM images we con- 
sider as extended those sources with extension likelihood > 3 in 
either the full or the soft spectral bands. For point sources we wish 
to keep the sample selection as clean and transparent as possible. 
One of the advantages of our point-source detection methodology 
is that the selection function, which is based on the Poisson spuri- 
ous probability threshold, can be quantified to a high degree of ac- 
curacy (see sensitivity map section below). Introducing additional 
selection criteria, e.g. keeping sources above a given EMLDETECT 
detection likelihood threshold, would modify the selection function 
of the sample in a way that is hard to quantify. We therefore choose 
to keep those biases to the minimum by excluding only those point 
sources for which EMLDETECT failed to determine a reliable fit and 
did not estimate a detection likelihood. In the final XMM/SDSS 
catalogue those sources are listed with an EMLDETECT detection 
likelihood < and are considered spurious. Such sources are typi- 
cally associated with image artifacts, such as diffraction spikes that 
have not been properly removed, extended hot gas emission split 
by the detection algorithm into multiple sources etc. 

The source catalogues in different energy bands are merged to 
produce a unique list of sources for either a pointing or a project 
consisting of a single XMM observation. A source detected in a 
particular energy band is cross-matched with sources in other en- 
ergy bands by finding the closest counterpart. The search radius 
is set to the 70 per cent EEF circular radius, i.e. SMA x — e, 
where SMA is the semi-major axis and e is the ellipticity of the 
ellipse. If a source lies on multiple obsids or detectors the search 
radius is estimated as the exposure time weighted average of indi- 
vidual 70 per cent EEF circular radii. It is verified that this choice 
of search radius is the optimal for identifying the same source in 
different energy bands. In practice we start by band-merging the 
full band sources and then proceed with the soft, hard, vhrd and 
uhrd catalogues. 

As there is overlap between adjacent pointings of a project, 
it is essential to identify and remove duplicates to produce unique 
source lists for projects with multiple pointings. For each source of 
a pointing we search for the closest counterpart in adjacent point- 
ings within a fixed search radius of 12arcsec. From the list of du- 
plicates we keep in the final list the one with the smaller angular 
distance from the aimpoint of the pointing on which it is detected. 

In the final band-merged source catalogue of unique sources 
there are 1258 extended and 827 spurious sources and a total of 
39830 point sources. Table [T| presents the total number of point 
sources detected in each spectral band. It should be stressed that our 
detection method is geared toward point sources and therefore the 
extended source catalogue should be treated with caution. Provid- 
ing a well defined sample of X-ray selected clusters is not among 
our goals. The final point source catalogue also includes targets. 



6 ASTROMETRY 

The pointing errors of the XMM observations are of the order 
of few arcsec. The positions of individual X-ray sources how- 
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Figure 3. Distribution of the RA (bottom panel) and DEC (top panel) cor- 
rections applied to the celestial coordinates of the X-ray sources to account 
for systematic errors in the determination of the absolute XMM pointing. 
The (5RA and (5DEC offsets are estimated by matching the positions of 
X-ray sources with optical counterparts in the SDSS. 



ever, can be determined to an accuracy of 1-2 arcsec (Waston et 
al. 2009). The astrometry of the X-ray catalogue can therefore be 
improved by correcting the RA and Dec of X-ray sources for sys- 
tematic errors. For that purpose the positions of optically selected 
sources in the SDSS, which have sub-arcsec astrometric uncertain- 
Pier et alj2003 



are cross-correlated with the 



ties ( ^ 0.1 arcsec. 

X-ray source list. I'he EPOSCORR task of SAS is used to determine 
the offsets in RA, Dec and roll angle of an XMM pointing that 
maximises the number of optical identifications. Only X-ray point 
sources are used in this exercise, i.e. detections flagged as extended 
or spurious by the EMLDETECT task of SAS are excluded from the 
cross-matching. Also, only SDSS sources brighter than r = 22 mag 
are matched to the X-ray source positions. In the case of observa- 
tions with less than 8 optical identifications, no corrections are ap- 
plied to the X-ray source positions. Figure|3]plots the distribution of 
the RA, Dec offsets applied to the celestial coordinates of the X-ray 
sources and shows that typical corrections are about 1-2 arcsec. In 
the case of projects with multiple pointings the astrometry of indi- 
vidual obsids should be corrected for systematics prior to merging. 
We have verified however, that this approach produces results that 
are similar to the simpler and less time-consuming method of esti- 
mating astrometric corrections for the final source list of individual 
pointings of a project, i.e. after merging. 



7 FLUX ESTIMATION AND HARDNESS RATIOS 

For sources with Poisson detection probability < 4 x 10~® in at 
least one of the five spectral bands fluxes are estimated in all bands 
by summing up the counts within the 80 per cent EEF of the ellipti- 
cal PSFs described in Appendix|A] Compared to source detection, a 
larger aperture is adopted for the photometry. The choice of radius 
is to include as large a fraction of source photons as possible, while 
at the same time keeping the background noise low. Also, for larger 
EEFs, the radius becomes large enough that the contribution of pho- 
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tons from nearby sources becomes an issue. The local background 
at the position of a source is estimated as described in the previous 
section. The counts from all EPIC cameras of all the XMM obser- 
vations that a source lies on are summed up for the estimation of 
the flux in a particular energy band. The different response of the 
EPIC cameras is accounted for by adopting the appropriate count 
to flux conversion factors, ECF. Suppose a source on the detector 
i of an XMM observation. The total number of counts, d, at the 
source position within the 80 per cent EEF are 



a^fxx ECF, X EEF xU + B„ 



(1) 



where fx is the source flux, ECFi is the count to flux conversion 
factor for the camera i, EEF = 0.8 is the encircled energy frac- 
tion, ti is the exposure time at the source position and Bi is the 
local background value. Summing up over i, i.e. over all observa- 
tions and all EPIC cameras on which the source lies on gives 



Y^C,^ fxx EEF X (eCF, xtA+J2 B- 



(2) 



The simplest approach for estimating the source flux is to solve the 
equation above for fx 



(3) 



The equation above however, ignores the Eddington bias and the 
fact that there is a continuous probability distribution of fluxes 
which can produce the observed counts. We account for these ef- 
fects by adopting the Bayesian methodology described in |Laird| 
|et al . ( 2009 1 to calculate the source fluxes and la confidence limits 
from the detected counts. In brief, the probability of the source hav- 
ing flux fx given the observed number of total counts C (source 
and background) follows Poisson statistics 



Pifx,C) 



C\ 



(4) 



where T is the mean expected total counts in the detection cell. This 
quantity is determined as a function of flux from equation [T] The 
last term in equation]?] 7r(/x), reflects our prior knowledge of the 
distribution of source fluxes, i.e. the differential X-ray source num- 
ber counts, which are described by a double power-law (e.g. |Geor^ 
Igakakis et al.||2008| . This term accounts for the Eddington bias, 
in which statistical variations of the observed source counts com- 
bined with the steep log — log S of the X-ray population result in 
brighter measured fluxes for the detected sources compared to their 
intrinsic ones. The differential X-ray source counts are approxi- 
mated by a double power-law with faint and bright end slopes of - 
1.5 and -2.5 respectively and a break flux of 10"^'* ergs"^ cm~^. 
The adopted values are representative of the double power-law pa- 
rameters estimated for X-ray samples selected in different energy 
bands (e.g. [Georgakakis et al.|2008] l. Equation]4]is used to estimate 
the mode of the X-ray flux distribution, which can be calculated 
analytically. The 68 per cent confidence limits on the flux are esti- 
mated by integrating equation]4]as described in jLaird et al.| ( |200^ . 

The energy to flux conversion factor is estimated for each 
EPIC camera separately assuming a power-law X-ray spectrum 
with r = 1.4, i.e. similar to the XRB, absorbed by the appropri- 
ate Galactic hydrogen column density. The latter is derived from 
the HI map of jKalberla et al.| ( |2005^ using the right ascension and 
declination of the aimpoint of each XMM observation and the NH 
task of FTOOLS. The energy to flux conversion factors are such that 
the counts from the soft, full, hard, vhrd and uhrd bands are trans- 



formed to fluxes in the 0.5-2, 0.5-10, 2-10, 5-10 and 7.5-12keV 
bands respectively. 

Hardness ratios (HRs) are estimated between the soft (0.5- 
2keV) and the hard (2-8 keV) band count rates, S and H respec- 
tively, following the standard definition 

H- S 



HR: 



H + S' 



(5) 



The BEHR (Bayesian Estimation of Hardness Ratios, [Park et al.] 
]M06 ) code is used, which is designed to determine HRs in the Pois- 
son regime of low counts and to compute the proper uncertainty re- 
gardless of whether the source is detected in both passbands or not. 
Hardness ratios were estimated independently for the PN, MOSl 
and M0S2 detectors. In the case of multiple overlapping observa- 
tions the summed source and background counts as well as the to- 
tal exposure time from each detector are used as input to the BEHR 
code. The estimated HRs are scaled to the on-axis exposure time by 
setting the BEHR parameters SOFTEFF and HARDEFF to the ratios 
of the on-axis exposure time over the exposure time at the source 
position in the soft and hard bands respectively. In the case of over- 
lapping observations these parameters are defined by the relation 



(6) 



where ef f represents either the SOFTEFF or the HARDEFF BEHR 
parameter, the summation is over all XMM observations, i, d rep- 
resents the counts (soft or hard band) at the source position (source 
and background), ton.i is the on-axis exposure time and ti is the 
exposure time at the source position. 



8 SENSITIVITY MAPS 

The construction of the sensitivity maps follows the methodology 
of [Georgakakis et aL] ( j2008 l, which accurately estimates the prob- 
ability of detecting a source with a given X-ray flux accounting 
for observational effects, such as vignetting, flux estimation biases 
(e.g. Eddington bias) and the fraction of spurious sources expected 
in any source catalogue. The important parameters in this exercise 
are the size and shape of the detection cell, which are well de- 
fined in our source detection method, and the Poisson probability 
threshold, Pthresh, below which an excess of counts is considered 
a source. By fixing Pthresh one sets the minimum number of pho- 
tons in a cell, L, for a formal detection. 

The first step is the estimation of the source-free background 
across the EPIC detectors. The background map is estimated using 
custom routines to first remove the counts in the vicinity of detected 
sources using the 80 per cent EEF elliptical apertures. Pixel values 
in the source regions are replaced by sampling from the distribution 
of pixel values in the local background regions. These are defined 
by elliptical annuli centred on each source with inner semi-major 
axis 5 pixels (~ 21.8arcsec) larger than the 80 per cent EEF size 
and widths of 15 pixels (~ 62.3arcsec). The resulting maps are 
smoothed by replacing each pixel value with the median within a 
sliding box of 20 pixels (87 arcsec) in size. The smoothed back- 
ground maps are then used to estimate the mean expected back- 
ground counts, within the detection cell, B (i.e. the 70 per cent 
EEF elliptical apertures). In the case of merged images from differ- 
ent detectors and/or XMM observations, B is the sum of the mean 
expected background counts within the detection cell of individual 
images. The cumulative probability that the observed counts in a 
particular detection cell will exceed L is 
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Pb{> L)^-fiL,B), 



(7) 



where the function j{a, x) is the incomplete gamma function de- 
fined as 



1 



(8) 



Adopting a detection threshold Pthresh one can invert equation |7] 
numerically to estimate the (integer) detection limit L for a cell 
with mean expected background B. Repeating this exercise for dif- 
ferent cells across the image one can determine L as a function of 
position (x,y) on the detector. This 2-D image of L values is the 
sensitivity map. Note that the sensitivity map is independent of the 
spectral shape of the source. 

A useful 1-D representation of this image, with a wide range 
of applications, is the total detector area in which a source with flux 
fx can be detected. The cumulative distribution of the area plotted 
as function of fx is often referred to as sensitivity curve. This was 
constructed adopting a Bayesian approach as follows. For a source 
with flux fx and a given spectral shape (F — 1.4 in this paper) we 
can determine the probability of detection in a cell with detection 
limit L and mean background B. The total observed counts in the 
cell are C — S + B, where S is the mean expected source con- 
tribution. C is given by equation [2] in the generic case of merged 
images from different EPIC detectors and/or XMM observations. 

Both B and S fluctuate and therefore using equation |7] the 
probability their sum exceeds the detection threshold is Pc.fx (— 
L) = 7(1/, C). The sensitivity curve is the sum of the Pcjx (— ^) 
distributions of individual detection cells. The sensitivity curve of 
the XMM/SDSS survey in different bands is plotted in Figure |4] 
That figure also shows that the total XMM/SDSS surveyed area 
is 122 deg^. In addition to the Bayesian approach we also esti- 
mate sensitivity curves adopting the standard methodology of as- 
signing a single limiting flux to a detection cell, i.e. assuming that 
the minimum net counts for a source to be detected in the cell are 
S = L — B. Figure |5] compares the Bayesian and standard sensi- 
tivity curves in different spectral bands and shows that they differ 
substantially at the faint flux end. 



9 OPTICAL IDENTIFICATIONS 



The Likelihood Ratio method (LR; [Sutherland & Saunders] 1 992| 
|Laird et al.|200 9) is adopted to identify X-ray sources with optical 
counterparts in the SDSS. In this exercise the quantities of inter- 
est are (i) the probability that an optical source, at a given distance 
from the X-ray position and with a given optical magnitude, is the 
true counterpart and (ii) the probability that the same source is a 
spurious alignment. The likelihood ratio is defined as the ratio be- 
tween the two probabilities 



LR: 



g(m) /(r) 
n(m) ' 



(9) 



where g(m) is the expected magnitude distribution of the true op- 
tical counterparts, /(r) is the probability distribution function of 
the positional uncertainties in both the X-ray and the optical source 
catalogues and n(m) is the background density of optical galaxies 
of magnitude m in the SDSS r-band. 

For the positional accuracy of the X-ray sources we adopt a 
Gaussian distribution with standard deviation of 1.5 arcsec (see be- 
low). The a priori probability qijm) that an X-ray source has a coun- 
terpart with magnitude m, is determined as follows. Firstly, the 
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Figure 4. Sensitivity curves of the XMM/SDSS survey in different spectral 
bands. The curves are red dashed: full-band; blue dotted: soft-band; black 
continuous: hard-band; green long-dashed: very-hard band; black dashed- 
dotted: ultra-hard band. The total surveyed are is 122 deg^. 
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Figure 5. Ratio between the standard and Bayesian sensitivity curve es- 
timates as a function of flux in different spectral bands. The curves are 
red dashed: full-band; blue dotted: soft-band; black continuous: hard-band; 
green long-dashed: very-hard band; balck dashed-dotted: ultra-hard band. 
At faint fluxes the standard and Bayesian approaches for determining the 
sensitivity curve differ substantially. 



X-ray and optical source catalogues are matched by simply find- 
ing the closest counterpart within a fixed search radius of 4 arcsec. 
The magnitude distribution of the spurious matches is estimated by 
scaling n(m) to the area of 4 arcsec radius within which we search 
for counterparts. This is then subtracted from the magnitude dis- 
tribution of the counterparts in the real catalogs to determine the 
magnitude distribution of the true associations, g(m). 
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Having estimated f[r) and g(m) we identify all possible 
counterparts to X-ray sources within a 4 arcsec radius and the LR 
of each one is determined using equation [9] We consider as coun- 
terparts those galaxies with LR above a certain limit. The choice 
of the cutoff in LR is a trade off between maximum number of 
counterparts and minimum spurious identification rate. In order to 
assess how secure an optical counterpart is we use the reliability 
parameter defined by Sutherla nd & SaundersH1992| 

LRi 



Reli = 



Ej LRj + (1 - Q) ' 



(10) 



where Reli is the reliability of the i optical counterpart of an X- 
ray source, the index j of the summation runs over all the possible 
counterparts within the search radius and Q is the fraction of X-ray 
sources with identifications to the magnitude limit of the optical 
survey. It can be shown that the sum of the reliabilities of individ- 
ual counterparts equals the expected number of true associations 
(Sutherland & Saunders 1992). Comparison of Ei R^li with the 
total number of counterparts with LR > LRnmit provides an es- 
timate of the spurious identification rate. By varying LRnmit one 
can minimise the number of false associations. We chose to use 
LRiimit = 1-5. At this cutoff 49 per cent (19431/39830) of the 
X-ray point sources have counterparts with an estimated spurious 
identification rate of about 7 per cent. Table[T]presents details on the 
total number of optical identifications for different spectral bands as 
well as the number of sources in subsmaples defined by applying 
various selection criteria, e.g. magnitude cuts, spectroscopic red- 
shift availability etc. 

The optical identification fraction for the full band is plotted 
as function of 0.5-lOkeV flux limit in Figure|6] The optical iden- 
tification fraction increases to about 90 per cent at bright fluxes, 
fx ~ 10~^^ erg cm~^. The magnitude distribution of the op- 
tical counterparts of the XMM/SDSS sources is shown in Figure|7] 
The distribution of X-ray sources in the X-ray to optical flux ratio 
{fx I fo-pt) plane is shown in Figurejs] A total of 2067 X-ray point 
sources have spectroscopic observations from the SDSS. The red- 
shift histogram is shown in Figure |9] For X-ray sources associated 
with galaxies (i.e. optically resolved) brighter than r < 17.77 mag 
(the magnitude cut of the SDSS main galaxy spectroscopic sam- 
ple) the redshift sampling rate is about 90 per cent, i.e. similar to 
that of the overall SDSS galaxy population with r < 17.77 mag. 
The majority of spectroscopically unidentified X-ray sources that 
are brighter than this limit are associated with unresolved optical 
sources, i.e. they are most likely distant QSOs or Galactic stars. 

Figure [T0| shows that the distributions of the RA and Dec an- 
gular separations between the positions of X-ray sources and their 
optical counterparts {LR > 1.5) can be approximated with Gaus- 
sians with FWHM of about 1 arcsec. The overall astrometric ac- 
curacy of the X-ray catalogue is therefore about 1.5 arcsec. This 
justifies the use of a normal distribution with a standard deviation 
of 1.5 arcsec for f(r) in equation[9] 

The XMM/SDSS source catalogue is made available to the 
publicQ in fits format. The information included in that file is listed 
in Appendix [B] 



10 THE COLOUR-MAGNITUDE DIAGRAM 

One of the main motivations for the XMM/SDSS serendipitous sur- 
vey is the compilation of a low redshift X-ray selected AGN sam- 
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Figure 6. Distribution of the fraction of full-band selected X-ray point 
sources with optical counterparts as a function of O.S-lOkeV flux limit. 
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Figure 7. SDSS r-band magnitude distribution of full band selected X-ray 
sources in the XMM/SDSS survey. 



pie which can be compared with high redshift AGN identified in 
deep X-ray surveys. In this section we present preliminary results 
on the comparison of the colour-magnitude diagram (CMD) of X- 
ray AGN at ^ ^ 0.1 and z 0.8. 

The low redshift X-ray sample consists of 293 hard-band (2- 
8keV) detections in the XMM/SDSS survey with 0.03 < z < 
0.2 and r < 17.77 mag after correcting for Galactic extinction 
l |Schlegel et al.|1998) . The magnitude cut corresponds to the limit 
of the SDSS Main Galaxy Sample (r < 17.77 mag Strauss et aT] 
|2002[ l, which provides the vast majority of redshifts in the SDSS. 
The photometry is from the New York University Value-Added 
Galaxy Catalog (NYU-V AGC [Blanton et al.|[2005l > which corre- 
sponds to the SDSS DR7 ( |Abazajian et al.|2009p rhis is because 
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Table 1. X-ray point source catalogue details 



Sample 


total number 


LR > 1.5 


spectioscopy 


r < 17. 77 mag 


r < 17.77mag AND 


r < 17.77magAND 




of sources 








spec-z 


0.03 < Zspcc < 0.2 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


full band 


35727 


18246 


1949 


1920 


844 


560 


soft band 


33162 


17631 


1943 


2018 


840 


552 


hard band 


14733 


8803 


1237 


728 


496 


298 


vhrd band 


2245 


1722 


544 


375 


297 


174 


uhrd band 


258 


232 


143 


140 


111 


58 



The columns are: (1): X-ray sample; (2): total number of X-ray point sources; (3) sources with optical counterpart (LR > 1.5); (4) X-ray sources with LR > 1.5 and SDSS spectroscopy (5) X-ray 
sources with optical counterparts brighter than r — 17. 77 mag {petrosian magnitude, AB system) after correcting for Galactic extinction; (6) X-ray sources with r < 17.77 mag and spectroscopic redshift 
measurement; (7) X-ray sources with r < 17.77 mag and redshifts in the range 0.03 — 0.2, i.e. the selection criteria used to construct the colour magnitude diagram of section 9. Note however, that in section 
9 we use the photometry from the New York University Value-Added Galaxy Catalog (Blanton et al. 2005) and therefore there lue small differences compared in the sample size compared to the numbers listed 
in column (7). 
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Figure 8. r-band magnitude against 0.5-lOkeV flux for the XMM/SDSS 
X-ray sources with optical counterparts. The diagonal lines correspond to 

log /x//opt = ±1. 



the NYU-VAGC provides better photometric calibration of the 
SDSS data ( [Padmanabhan et al.|2008[ l compared to the DR7 and an 
accurate description of the SDSS window function. A small number 
of XMM/SDSS sources with 0.03 < z < 0.2 and r < 17.77 mag 
(total of 5, see column 7 of Table[TJ do not have photometric mea- 
surements in the NYU-VAGC and have been excluded from the 
final sample. Visual inspection shows that all of them are close to 
the edge of the SDSS field of view. The choice of the 2-8 keV X-ray 
band is because it is less affected by obscuration biases compared to 
softer energies and the XMM's sensitivity remains high in that band 
thereby resulting in a sample that is sufficiently large for statistical 
studies. The CMD of the overall galaxy population at z ~ 0.1 is 
defined by selecting a total of 13,619 NYU-VAGC sources with 
r < 17.77 mag that overlap with the XMM/SDSS survey fields 
and have spectroscopic redshifts in the interval 0.03 < z < 0.2. 

The KCORRECT version 4.2 routines developed by [Blanton &| 
|Roweis| ( |2007p are used to fit spectral models to the NYU-VAGC 
ugriz photometry and then estimate rest-frame magnitudes (AB 
system) in the °'^u, ^''^g bands, which are the SDSS it, g filters 
shifted to z — 0.1. The advantage of this choice of bandpasses is 
twofold. They are close to rest-frame for our z ~ 0.1 XMM/SDSS 



Figure 9. Redshift distribution of the XMM/SDSS X-ray sources in loga- 
rithmic bins of size A log z = 0.1. 



sample and also have effective wavelengths that are similar to the 
rest-frame effective wavelengths of the DEEP2 R and I filters at 
z = 1 ( |Blanton|2006) . 

The high redshift CMD is constructed using data from the 
AEGIS ( [Davis et al.|2007) . The spectroscopic sample consists of 
6797 galaxies observed as part of the DEEP2 survey (Data Release 
3) with secure redshift determinations (> 90% confidence level, 
quality flag Q > 3; |Davis et al. 2007 ) in the interval 0.6 < z < 1.2. 
The X-ray data in the AEGIS are from the Chandra survey of the 
Extended Groth Strip (AEGIS-X survey. Laird et al. 2009). The 
X-ray sources are identified with DEEP2 galaxies using the Likeli- 
hood Ratio methodology described in Laird et al. (2009). The spec- 
troscopy for X-ray sources is from a variety of sources: the DEEP2 
redshift survey, the SDSS and a number of spectroscopic programs 
that have targeted galaxies in the original Groth Strip ( [Weiner et al.[ 
[2005[ >. We select a total of 115 X-ray sources in the spectroscopic 
redshift interval 0.6 < z < 1.2 detected in the 0.5-7 keV band of 
the AEGIS-X survey. The median redshift of the sample is 0.8. The 
choice of the X-ray band for the detection of sources is to ensure 
that our z ~ 0.8 sample is selected at similar rest-frame energies 
(l-14keV) as the 2: « 0.1 sample (2-8 keV), thereby facilitating the 
comparison. For the estimation of rest-frame quantities we use the 
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Figure 10. Angulai' separation distribution between the X-ray sources and 
their optical counterparts in the SDSS (LR > 1.5). The top and bottom 
panels show the Dec and RA offset histograms respectively. The curves are 
Gaussians with FWHM of about 1 arcsec and provide a fair approximation 
of the observed distributions. 



BRI photometry obtained as part of tlie DEEP2 survey ( |CoiI et aL] 
|2004[ l. Tlie KCORRECT routines are used to fit models to the BRI 
data and then estimate the rest-frame "'^u and '^'^(j-band magni- 
tudes from the observed DEEP2 R and J-band photometry respec- 
tively. 

Figure [TT] compares the absorption corrected 2-lOkeV X-ray 
luminosity and the hydrogen column density (Nh) distributions of 
X-ray sources in the AEGIS sample and the low redshift subset 
of the XMM/SDSS survey. The Nh of individual sources are de- 
termined from the hardness ratios between the soft (0.5-2 keV for 
both AEGIS and XMM/SDSS) and the hard (2-7 keV for AEGIS, 
2-8 keV for XMM) X-ray bands assuming an intrinsic power-law 
X-ray spectrum with index V — 1.9 (e.g. [Nandra & Pounds] 1994) 1. 
The derived column densities are then used to correct the X-ray 
luminosities for photoelectric absorption. The XMM/SDSS sam- 
ple at 2; ~ 0.1 is sensitive to sources with Lx(2 — lOkeV) > 
10*" erg s~^. In contrast the AEGIS detects X-ray sources brighter 
than logLx(2 - lOkeV) > 41.5 (ergs~^) at z ^ 0.8. To avoid 
luminosity dependent biases in the comparison of the CMDs of 
X-ray sources in the AEGIS and the low redshift subset of the 
XMM/SDSS we limit both samples to logLx(2 - 10 ,keV) > 
41.5 (erg s~^). X-ray sources with luminosities < 10'*^ ergs~^ are 
often excluded from AGN studies because normal galaxies are be- 
lieved to be an important source of contamination below this limit. 
Although normal galaxy candidates are present at faint X-ray lu- 
minosities, AGN remain the dominant population. In |Georgakakis| 
|et al.| ( [2b07| l for example. X-ray detected starburst galaxy candi- 
dates represent only about 20 per cent of the GOODS-North X-ray 
source population in the luminosity interval 10*" — 10*^ ergs~^. 
This fraction is also likely to be a upper limit as some of the galaxy 
candidates may turn out to be low luminosity AGN. Furthermore, 
Georgakakis (20081 cross-correlated the 2XMMp catalogue ( |Wat-| 
son et al.j,2009^ with the SDSS spectroscopic database to select 
normal galaxies at z « 0.1. Such sources were found to repre- 
sent 20 per cent of the overall X-ray population for luminosities 



10 — 10 erg s~ . This fraction is also likely to be a strong func- 
tion of luminosity within that interval. Contamination by non-AGN 
is therefore not a concern for the XMM/SDSS and the AEGIS X- 
ray source samples with log Lx (2 — 10 , keV) > 41.5 (ergs^^). 
This cut reduces the total number of low redshift (0.03 < 2 < 0.2, 
r < 17.77 mag) XMM/SDSS X-ray sources from 293 to 209. The 
sample includes a total of 16 XMM targets, i.e. non serendipitous 
X-ray sources, most of which (13/16) are type-1 AGN with broad 
optical emission lines. We choose to keep them in the sample as 
they are not affecting the main results and conclusions. 

■°-'^{u-g) vs A/o.ig CMD of galaxies 



Figure 
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; compares the 

and X-ray sources in the AEGIS and the low redshift XMM/SDSS 
survey sample. X-ray sources which either show broad emission 
lines in their optical spectra and/or a prominent nuclear point 
source in their broadband optical images (SDSS or HST/ACS sur- 
vey of the AEGIS) are marked in Figure[T2] The optical continuum 
of those sources is significantly contaminated by AGN light. Their 
colours do not provide information on their host galaxies and are 
therefore excluded from further analysis. As expected the optical 
colours of these sources are very blue. The °-^{u - g) distribution 
of the two samples are shown in Figure [13] which better demon- 
strates the bimodality of the galaxy colours in both the AEGIS 
and the SDSS. We set the separation between red sequence and 
blue cloud galaxies at °-'^{u - 5) = 1.4 for both the AEGIS at 
z ^ 0.8 and the SDSS at 2: ^ 0.1. Although the dividing line be- 
tween red and blue galaxies also depends on absolute magnitude 
(e.g. [Bell et aiT2004, .Willmer et al.,2006.) , for our purposes it is 
sufficient to apply a simple colour cut (e.g. Blanton 2007). X-ray 
sources with colours contaminated by AGN light are not plotted 
in Figure [13] It is interesting that X-ray AGN in both the AEGIS 
and the low redshift subset of the SDSS/XMM survey have similar 
rest-frame colour distributions. The Kolmogorov-Smimov test es- 
timates a probability of 50 per cent that the two samples are drawn 
from the same parent population. Also, the fraction of X-ray AGN 
with "'^ (it — g) > 1.4 is 50 ± 4 in the low redshift subset of the 
XMM/SDSS and 56 ± 5 in the AEGIS, where the errors are esti- 
mated using binomial statistics. 

Before interpreting those fractions however, the differences in 
the selection function of the two samples need to be accounted for. 
X-ray AGN in the AEGIS include, on average, a higher fraction of 
AGN with high hydrogen column density, Nh > 10^^ cm^^ (see 
Figure right panel). The AGN light of those obscured AGN is 
less likely to affect the host galaxy colours. We account for these 
effects using the simulations described in the appendix |C] to ex- 
plore how the rest-frame colours of the AEGIS X-ray sources at 
2 ~ 0.8 would appear in the low redshift (0.03 < z < 0.2) sub- 
sample of the XMM/SDSS survey. In brief, AEGIS X-ray sources 
are assigned a random redshift in the range 0.03 — 0.2. Their X- 
ray flux (2-lOkeV band) and apparent magnitude (SDSS r-band) 
are estimated at this new redshift to determine if they fulfill the se- 
lection criteria (e.g. X-ray sensitivity curve, r < 17.77 mag) of the 
low redshift subsample of the XMM/SDSS survey. We can then ex- 
plore how the various redshift dependent selection effects modify 
the derived rest-frame colour distribution and if they alter the frac- 
tion of AGN in red hosts. The simulated distribution of "'^(it — g) 
colours is plotted in Figure[T4] It is found that the fraction of X-ray 
sources with "'^ (u — g) > 1.4 is 55 per cent, i.e. the same as for the 
~ 0.8 X-ray sources. Differential selection effects are not impor- 
tant for the comparison of the CMD of X-ray sources in the AEGIS 
and the XMM/SDSS survey. We therefore conclude that the data 
are consistent with no evolution of the rest-frame colours of X-ray 
AGN from 2 = 0.8 to 2 = 0.1. 
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Figure 11. Left panel: 2-lOkeV X-ray luminosity distribution of X-ray sources in the AEGIS (top) and the low redshift subset of the XMM/SDSS (bottom) 
samples. The X-ray luminosities are corrected for photoelectric absorption. In the case of AEGIS the 2-lOkeV luminosity is estimated from the observed 
0.5-7 keV count rate. The XMM/SDSS sample at 0.03 < z < 0.2 reaches luminosities as faint as 10'"' erg s~^, while AEGIS at 0.6 < 2 < 1.2 probes more 
luminous X-ray sources. Right panel: Histograms of the absorbing hydrogen column density, Nu in the AEGIS (top) and z ^ 0.1 XMM/SDSS (bottom) 
samples. This is determined from the observed hardness ratio of individual sources. For the AEGIS the hardness ratio is estimated from the count rates in the 
Chandra 0.5-2 and 2-7 keV spectral bands. In the case of the XMM/SDSS sample the hardness ratio is defined between the XMM's 0.5-2 and 2-8 keV bands. 




Figure 12. '^■^{u — g) vs Ma.ig Colour magnitude diagram of galaxies (blue dots) and X-ray AGN (red circles) in the XMM/SDSS low redshift sample 
(0.03 < 2 < 0.2; left panel) and the AEGIS (0.6 < z < 1.3; right panel). Crosses indicate X-ray sources with rest-frame colours contaminated by AGN 
light. 



11 DISCUSSION AND CONCLUSIONS 

We present a new serendipitous X-ray survey, the XMM/SDSS, 
which is based on archival XMM observations and covers 122 deg^ 
in the SDSS DR7 footprint. The size and sensitivity of this sur- 
vey are well suited for low redshift X-ray AGN studies. The 
XMM/SDSS source catalogue is combined with the SDSS opti- 
cal spectroscopy to compile one of the largest hard X-ray (2-8 keV) 
AGN samples to date in the redshift interval 0.03 < z < 0.2. The 



selection function of this sample is well defined and similar to that 
of deep, pencil-beam X-ray surveys, which probe AGN at moderate 
and high redshift. This greatly facilitates the study of the properties 
of AGN across redshift to shed light on the physical mechanism(s) 
responsible for the rapid decline of the accretion power of the Uni- 
verse from z ~ 1 to the present day. Variations with redshift of the 
dominant SMBH fueling mode, which in some scenarios drive the 
evolution of the AGN population (e.g. |Hopkins & Hemquist|2006[ 
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Figure 13. "'^(ji — g) colour distribution of galaxies (blue hatched his- 
togram) and X-ray AGN (red solid histogram) in the XMM/SDSS low red- 
shift sample (0.03 < z < 0.2; lower panel) and the AEGIS (0.6 < z < 
1.3; upper panel). All histograms are normalised by the total number of 
sources in each sample. 
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main AGN fueling mode at early epochs, while hot gas accretion 
becomes increasingly more important at later times. 

Systematic variations with redshift of the rest-frame colours 
of AGN hosts provide a first order test to the scenarios above. In 
the Fanidakis et al. (2010) prescription for example, the AGN pop- 
ulation should shift to redder hosts with decreasing redshift. The 
opposite trend might be expected if disk instabilities dominate the 
accretion power at later epochs. Although a number of recent stud- 
ies explored the position of AGN hosts on the colour-magnitude 
diagram (e.g. |Hickox et al.|2009[|Cardamone et al.|2010[|Xue et al.| 
|2010[ l, none of them has investigated possible evolutionary effects 
with redshift, as it is done in this paper. We find no evidence for 
evolution of the CMD of X-ray AGN hosts from 2 0.8 (AEGIS- 
X) to z ^ 0.1 (XMM/SDSS). Apart from changes which are con- 
sistent with the evolution of the overall galaxy population between 
those redshifts. X-ray AGN are found in similar host galaxies, in 
terms of integrated colours, at both 2; = 0.8 and z = 0.1. This 
finding does not support the scenario in which the dominant mode 
of accretion changes with redshift since z = 0.8. 

It should be noted however, that a number of biases may affect 
this conclusion. Although we have removed sources with colours 
dominated by the central engine (i.e. broad optical emission lines, 
bright nuclear point sources), we cannot exclude the possibility 
of residual contamination by scattered AGN light. Nevertheless, 
[Pierce et al.| ( |2010l > studied the HST colour profiles of AGN hosts 
in the AEGIS survey. They concluded that the light from the ac- 
tive nucleus does not have a strong effect on the integrated colours 
of the majority of the AGN in their sample. Recenty, |Cardamonel 
|et al.|p010^ suggest that about 25 per cent of the red cloud X-ray 
AGN at z « 1 are dusty star-forming systems, not evolved galax- 
ies with a dominant old stellar population component. Systematic 
variations with redshift of the fraction of dust-reddened AGN hosts 
would impact our results and conclusions. 

A striking feature of Figure [12] is the large fraction of blue 
type-1 AGN among low redshift X-ray sources compared to the 
AEGIS-X survey. The low redshift XMM/SDSS subsample inl- 
cudes XMM targets, which in their majority are broad line AGN 
(13/16). Also, the SDSS spectroscopic targets inlcude apart from 
galaxies, QSOs at both low and high redshift, ROSAT X-ray 
sources, FIRST radio galaxies. The SDSS therefore has a higher 
spectroscopic sampling rate for type-1 AGN compared to the 
DEEP2 spectroscopic survey of the AEGIS. We conclude that se- 
lection effects are primarily responsible for the difference in the 
relative fraction of type-1 AGN begtween the AEGIS-X and the 
low redshift subset of the XMM/SDSS survey. 



Figure 14. The top panel shows the ^ ( it — g) colour distribution of X-ray 
AGN at 0.6 < z < 1.2 in the AEGIS field. The middle panel shows the 
colour distribution of that sample as it would appear in the redshift interval 
0.03 < 2 < 0.2 in our XMM/SDSS survey. For comparison the bottom 
panel plots the colour distribution of X-ray AGN in the range 0.03 < z < 
0.2 from the XMM/SDSS survey. 

|Hasinger|2008[ |Fanidakis et al.|2010[ l, are expected to imprint on 
the properties of AGN, including those of their host galaxies. It has 
been proposed for example that major mergers dominate the AGN 
population at high redshift and bright luminosities, while stochas- 
tic accretion onto the SMBH of disk galaxies become important at 
low redshift and luminosities ( [Hopkins & Hemquist|2006l l. Alter- 
natively, Fanidakis et al. (2010) argue that disk instabilities are the 
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APPENDIX A: POINT SPREAD FUNCTION 

The XMM Science Simulator (SciSiM) version 4.0.4 was used 
to produce point sources with monochromatic flux of 2 x 
10~® photons/mm/s at 1, 2, 3.5, 6 and 8.5 keV, i.e. the mean en- 
ergies of the soft, full, hard, uhrd and vhrd bands respectively. The 
simulated sources were placed in a grid of positions separated by 
90arcsec on the PN, MOSl and M0S2 cameras. A denser grid, 
although desirable, would be computationally expensive. Also, 
no simulations were produced for positions that are closer than 
21 arcsec radius from CCD gaps, as in those cases the PSF size 
and shape cannot be reliably determined. The total exposure time 
was set to 50 ks. Because of technical issues related to the SciSiM 
code, this was spUt into 5 separate 10 ks simulations, which were 
then merged into a single event file. The source photon flux and 
the integration time were chosen to produce sufficient number of 
counts on the EPIC cameras to determine the PSF properties. Each 
simulation consists of a single energy, position and detector. There 
are 324, 361 and 380 PSFs at each energy for the PN, MOSl and 
MOS2 cameras respectively. 

The SciSiM generated ODE level files which were processed 
with SAS to produce event files, images and exposure maps for 
each simulation. The source counts were fit with ellipses using the 
IRAF (Image Reduction and Analysis Facility) task ellipse of the 
STSDAS package, which was designed to fit the isophotes of opti- 
cal galaxies. The task reads 2-dimensional images and fits ellipses 
to the isophotes at predefined semi-major axis values. During the 
ellipse fitting process the centre of the source was kept fixed. The 
position angle, PA, and ellipticity, e, were left free parameters up to 
the semi-major axis value where S/N constraints terminated the fit- 
ting. Beyond that radius the position angle and ellipticity were kept 
constant. Parts of the image with exposure time equal to zero (e.g. 
CCD gaps) were masked out diu-ing the fitting process. The out- 
put of the task is a table that includes, among other parameters, the 
best fit ellipticity and position angle at different semi-major axis 
values. We used those parameters to estimate the PSF profile by 
extracting counts within elliptical annuli. The fraction of the pix- 
els of the annuli that fall either outside the EPIC field of view or 
within CCD gaps is accounted for in the calculations. The cumula- 
tive radial distribution of the corrected counts is constructed, which 
is then used to estimate by linear interpolation the semi major axis 
radius, e and PA of the 60, 70 and 80 per cent EEF of the PSF at dif- 
ferent monochromatic energies and positions on the EPIC CCDs. 



APPENDIX B: TABLE COLUMNS 
Bl source detection parameters 

UXID (30A): Unique X-ray source identification string. The unique 
name of the each source is a string which is composed from the 
project name, followed by the number of the pointing on which the 
source is detected or zero in the case of projects with a single XMM 
observation and ending with a sequential number. 
BOX_ID_SRC (J): identification number of a source on individual 
pointings or observations in the case of projects with a single XMM 
observation. 

RA (D): Right Ascension in degrees of the X-ray source before 
correcting for systematic offsets. 

DEC (D): DecUnation in degrees of the X-ray source before cor- 
recting for systematic offsets. 

RADECJERR (E): Positional imcertainty in arcsec of the source 



position. This parameter is estimated by the EWAVELET task of 
SAS. 

CNT_TOTAL_FULL_70 (J): full band counts at the source position 
from all EPIC cameras within the 70 per cent EEF ellipse. 
BKG_TOTAL_FULL_70 (E): expected number of full band back- 
ground counts at the source position from all EPIC cameras within 
the 70 per cent EEF ellipse. 

EXP_TOTALJTJLL_70 (E): full band exposure time in seconds at 
the source position from all EPIC cameras. This parameter is the 
average exposure time within the 70 per cent EEF ellipse. 
PROB_FULL (E): Poisson probabiUty that the source is spurious in 
the full band. 

CNT_PN_FULL_70 (J): full band counts at the source position on 
the PN camera within the 70 per cent EEF ellipse. 
BKG_PN_FULL_70 (E): expected number of full band background 
counts at the source position on the PN camera within the 70 per 
cent EEF ellipse. 

EXP_PN_FULL_70 (E): PN camera exposure time in seconds at the 
source position. This parameter is the average exposure time within 
the 70 per cent EEF ellipse. 

CNT_M1_FULL_70 (J): full band counts at the source position on 
the MOSl camera within the 70 per cent EEF ellipse. 
BKGJVI1_FULL_70 (E): expected number of full band background 
counts at the source position on the MOSl camera within the 70 
per cent EEF ellipse. 

EXP_M1_F'ULL_70 (E): MOSl camera exposure time in seconds 
at the source position. This parameter is the average exposure time 
within the 70 per cent EEF ellipse. 

CNT_M2_FULL_70 (J): full band counts at the source position on 
the MOS2 camera within the 70 per cent EEF ellipse. 

BKG_M2_FULL_70 (E): expected number of full band background 
counts at the source position on the MOS2 camera within the 70 
per cent EEF ellipse. 

EXP_M2_FULL_70 (E): M0S2 camera exposure time in seconds 
at the source position. This parameter is the average exposure time 
within the 70 per cent EEF ellipse. 

EMLT)ETJV1L_FULL (E): EMLDETECT detection likelihood in 
the full band for all EPIC cameras. 

EML_EXT_ML_FULL (E): Likelihood that the source is extended 
in the full band. This parameter is estimated by the emldetect 
task of SAS for all EPIC cameras. 

CNT_TOTAL_SOFT_70 (J): soft band counts at the source position 
from all EPIC cameras within the 70 per cent EEF eUipse. 
BKG_TOTAL_SOFT_70 (E): expected number of soft band back- 
ground counts at the source position from all EPIC cameras within 
the 70 per cent EEF ellipse. 

EXP_TOTAL_SOFT_70 (E): soft band exposure time in seconds at 
the source position from all EPIC cameras. This parameter is the 

average exposure time within the 70 per cent EEF ellipse. 
PROB_SOFT (E): Poisson probability that the source is spurious in 
the soft band. 

CNT_PN_SOFT_70 (J): soft band counts at the source position on 
the PN camera within the 70 per cent EEF ellipse. 

BKG_PN_SOFT_70 (E): expected number of soft band background 
counts at the source position on the PN camera within the 70 per 
cent EEF ellipse. 

EXP_PN_SOFT_70 (E): PN camera soft-band exposure time in sec- 
onds at the source position. This parameter is the average exposure 
time within the 70 per cent EEF ellipse. 

CNT_M1_SOFT_70 (J): soft band counts at the source position on 
the MOSl camera within the 70 per cent EEF ellipse. 
BKGJM1_SOFT_70 (E): expected number of soft band background 
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counts at the source position on tlie MOSl camera within the 70 per 
cent EEF ellipse. 

EXPJVI1_SOFT_70 (E): MOSl camera soft-band exposure time in 
seconds at the source position. This parameter is the average expo- 
sure time within the 70 per cent EEF ellipse. 
CNT_M2_SOFT_70 (J): soft band counts at the source position on 
the MOS2 camera within the 70 per cent EEF ellipse. 
BKG JVI2_SOFT_70 (E): expected number of soft band background 
counts at the source position on the M0S2 camera within the 70 per 
cent EEF ellipse. 

EXPjy[2_SOFT_70 (E): MOS2 camera soft-band exposure time in 
seconds at the source position. This parameter is the average expo- 
sure time within the 70 per cent EEF ellipse. 
EML_DETJVIL_SOFT (E): EMLDETECT detection likelihood in 
the soft band for all EPIC cameras. 

EML_EXT_ML_SOFT (E): Likelihood that the source is extended 
in the soft band. This parameter is estimated by the emldetect 
task of SAS for all EPIC cameras. 

CNT_TOTAL_HARD_70 (J): hard band counts at the source posi- 
tion from all EPIC cameras within the 70 per cent EEF ellipse. 
BKG_TOTAL_HARD_70 (E): expected number of hard band back- 
ground counts at the source position from all EPIC cameras within 
the 70 per cent EEF ellipse. 

EXP_TOTAL_HARD_70 (E): hard band exposure time in seconds 
at the source position from all EPIC cameras. This parameter is the 
average exposure time within the 70 per cent EEF elUpse. 
PROB_HARD (E): Poisson probability that the source is spurious 
in the hard band. 

CNT_PN_HARD_70 (J): hard band counts at the source position on 
the PN camera within the 70 per cent EEF ellipse. 

BKG_PN_HARD_70 (E): expected number of hard band back- 
ground counts at the source position on the PN camera within the 
70 per cent EEF ellipse. 

EXP_PN_HARD_70 (E): PN camera hard-band exposure time in 
seconds at the source position. This parameter is the average expo- 
sure time within the 70 per cent EEF ellipse. 
CNTjyil_HARD_70 (J): hard band counts at the source position on 
the MOSl camera within the 70 per cent EEF ellipse. 
BKGJVI1_HARD_70 (E): expected number of hard band back- 
ground counts at the source position on the MOSl camera within 
the 70 per cent EEF ellipse. 

EXPjyil_HARD_70 (E): MOSl camera hard-band exposure time 
in seconds at the source position. This parameter is the average 
exposure time within the 70 per cent EEF elUpse. 
CNT_M2_HARD_70 (J): hard band counts at the source position on 
the MOS2 camera within the 70 per cent EEF ellipse. 
BKGJV12_HARD_70 (E): expected number of hard band back- 
ground counts at the source position on the MOS2 camera within 
the 70 per cent EEF ellipse. 

EXP_M2_HARD_70 (E): M0S2 camera hard-band exposure time 
in seconds at the source position. This parameter is the average 
exposure time within the 70 per cent EEF elUpse. 
EMLJ3ETJVIL_HARD (E): EMLDETECT detection likelihood in 
the hard band for all EPIC cameras. 

EML_EXT_ML_HARD (E): Likelihood that the source is extended 
in the hard band. This parameter is estimated by the emldetect 
task of SAS for all EPIC cameras. 

CNT_TOTAL_VHRD_70 (J): very hard band counts at the source 
position from all EPIC cameras within the 70 per cent EEF ellipse. 
BKG_TOTAL_VHRD_70 (E): expected number of very hard band 
background counts at the source position from all EPIC cameras 
within the 70 per cent EEF ellipse. 



EXP_TOTAL_VHRD_70 (E): very hard band exposure time in sec- 
onds at the source position from all EPIC cameras. This parameter 
is the average exposure time within the 70 per cent EEF ellipse. 
PROB_VHRD (E): Poisson probability that the source is spurious 
in the very hard band. 

CNT_PN_VHRD_70 (J): very hard band counts at the source posi- 
tion on the PN camera within the 70 per cent EEF ellipse. 
BKG J'N_VHRD_70 (E): expected number of very hard band back- 
ground counts at the source position on the PN camera within the 
70 per cent EEF ellipse. 

EXP_PN_VHRD_70 (E): PN camera very hard band exposure time 
in seconds at the source position. This parameter is the average 
exposure time within the 70 per cent EEF ellipse. 
CNT_M1_VHRD_70 (J): very hard band counts at the source posi- 
tion on the MOSl camera within the 70 per cent EEF ellipse. 
BKG_M1_VHRD_70 (E): expected number of very hard band back- 
ground counts at the source position on the MOSl camera within 
the 70 per cent EEF ellipse. 

EXPjyil_VHRD_70 (E): MOSl camera very hard band exposure 
time in seconds at the source position. This parameter is the average 
exposure time within the 70 per cent EEF ellipse. 
CNT_M2_VHRD_70 (J): very hard band counts at the source posi- 
tion on the MOS2 camera within the 70 per cent EEF ellipse. 
BKGJvI2_VHRD_70 (E): expected number of very hard band back- 
ground counts at the source position on the MOS2 camera within 
the 70 per cent EEF ellipse. 

EXP_M2_VHRD_70 (E): M0S2 camera very hard band exposure 
time in seconds at the source position. This parameter is the average 
exposure time within the 70 per cent EEF ellipse. 
EML_DET_ML_VHRD (E): EMLDETECT detection Ukelihood in 

the very hard band for all EPIC cameras. 

EML_EXTJVIL_VHRD (E): Likelihood that the source is extended 
in the very hard band. This parameter is estimated by the emlde- 
tect task of SAS for all EPIC cameras. 

CNT_TOTAL_UHRD_70 (J): ultra hard band counts at the source 
position from all EPIC cameras within the 70 per cent EEF ellipse. 
BKG_TOTAL_UHRD_70 (E): expected number of ultra hard band 
background counts at the source position from all EPIC cameras 
within the 70 per cent EEF ellipse. 

EXP_TOTAL_UHRD_70 (E): ultra hard band exposure time in sec- 
onds at the source position from all EPIC cameras. This parameter 
is the average exposure time within the 70 per cent EEF ellipse. 
PROB UHRD (E): Poisson probability that the source is spurious 
in the ultra hard band. 

CNT_PN_UHRD_70 (J): ultra hard band counts at the source posi- 
tion on the PN camera within the 70 per cent EEF ellipse. 
BKG_PN_UHRD_70 (E): expected number of ultra hard band back- 
ground counts at the source position on the PN camera within the 

70 per cent EEF ellipse. 

EXP_PN_UHRD_70 (E): PN camera ultra hard band exposure time 
in seconds at the source position. This parameter is the average 
exposure time within the 70 per cent EEF ellipse. 
CNTJVI1_UHRD_70 (J): ultra hard band counts at the source posi- 
tion on the MOSl camera within the 70 per cent EEF ellipse. 
BKGjyil_UHRD_70 (E): expected number of ultra hard band 
background counts at the source position on the MOSl camera 
within the 70 per cent EEF ellipse. 

EXP_M1_UHRD_70 (E): MOSl camera ultra hard band exposure 
time in seconds at the source position. This parameter is the average 
exposure time within the 70 per cent EEF ellipse. 
CNT_M2_UHRD_70 (J): ultra hard band counts at the source posi- 
tion on the MOS2 camera within the 70 per cent EEF ellipse. 
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BKGJVI2_UHRD_70 (E): expected number of ultra hard band 
background counts at the source position on the M0S2 camera 
within the 70 per cent EEF ellipse. 

EXP_M2_UHRD_70 (E): M0S2 camera ultra hard band exposure 
time in seconds at the source position. This parameter is the average 
exposure time within the 70 per cent EEF ellipse. 
EML_DET_ML_UHRD (E): EMLDETECT detection hkelihood in 
the ultra hard band for all EPIC cameras. 

EML_EXT_ML_UHRD (E): Likelihood that the source is extended 
in the ultra hard band. This parameter is estimated by the EMLDE- 
TECT task of SAS for all EPIC cameras. 

RA_CORR (D): X-ray source Right Ascension in degrees corrected 
for systematic offsets of the XMM pointing. 
DEC_CORR (D): X-ray source Declination in degrees corrected for 
systematic offsets of the XMM pointing. 

LILCORR (D): X-ray source Galactic longitude in degrees corre- 
sponding to RA CORR and DEC CORR 

BILCORR (D) X-ray source Galactic latitude in degrees corre- 
sponding to RA_CORR and DEC_CORR 

B2 Source parameters related to flux estimation 

CNT_TOTAL_FULL_80 (J): full band counts at the source position 
from all EPIC cameras estimated within the 80 per cent EEF el- 
lipse. 

BKG_TOTAL_FULL_80 (E): expected number of full band back- 
ground counts at the source position from all EPIC cameras within 
the 80 per cent EEF ellipse. 

EXP_TOTAL_FULL_80 (E): full band exposure time in seconds at 
the source position from all EPIC cameras. This parameter is the 
average exposure time within the 80 per cent EEF ellipse. 
CNT_PN_FULL_80 (I): full band counts at the source position on 
the PN camera within the 80 per cent EEF ellipse. 
BKG_PN_FULL_80 (E): expected number of full band background 
counts at the source position on the PN camera within the 80 per 
cent EEF ellipse. 

EXP_PN_FULL_80 (E): PN camera full band exposure time in sec- 
onds at the source position. This parameter is the average exposure 
time within the 80 per cent EEF ellipse. 

CNTJVI1_FULL_80 (J): full band counts at the source position on 
the MOSl camera within the 80 per cent EEF ellipse. 
BKG_M1_FULL_80 (E): expected number of full band background 
counts at the source position on the MOSl camera within the 80 
per cent EEF ellipse. 

EXP_M1_FULL_80 (E): MOSl camera full band exposure time in 
seconds at the source position. This parameter is the average expo- 
sure time within the 80 per cent EEF ellipse. 
CNTJVI2 JULL_80 (J): full band counts at the source position on 
the M0S2 camera within the 80 per cent EEF ellipse. 
BKG JV12 JULL_80 (E): expected number of full band background 
counts at the source position on the M0S2 camera within the 80 
per cent EEF elUpse. 

EXPJVI2 JULL_80 (E): M0S2 camera full band exposure time in 
seconds at the source position. This parameter is the average expo- 
sure time within the 80 per cent EEF ellipse. 
CNT_TOTAL_SOFT_80 (J): soft band counts at the source posi- 
tion from all EPIC cameras estimated within the 80 per cent EEF 
ellipse. 

BKG_TOTAL_SOFT_80 (E): expected number of soft band back- 
ground counts at the source position from all EPIC cameras within 
the 80 per cent EEF ellipse. 

EXP_TOTAL_SOFT_80 (E): soft band exposure time in seconds at 



the source position from all EPIC cameras. This parameter is the 
average exposure time within the 80 per cent EEF ellipse. 
CNT_PN_SOFT_80 (J): soft band counts at the source position on 
the PN camera within the 80 per cent EEF ellipse. 
BKG_PN_SOFT_80 (E): expected number of soft band background 
counts at the source position on the PN camera within the 80 per 
cent EEF elUpse. 

EXP_PN_SOFT_80 (E): PN camera soft band exposure time in sec- 
onds at the source position. This parameter is the average exposure 
time within the 80 per cent EEF ellipse. 

CNT_M1_SOFT_80 (J): soft band counts at the source position on 
the MOSl camera within the 80 per cent EEF ellipse. 

BKG_M1_SOFT_80 (E): expected number of soft band background 
counts at the source position on the MOS 1 camera within the 80 per 
cent EEF ellipse. 

EXP_M1_SOFT_80 (E): MOSl camera soft band exposure time in 
seconds at the source position. This parameter is the average expo- 
sure time within the 80 per cent EEF ellipse. 
CNTjyi2_SOFT_80 (J): soft band counts at the source position on 
the M0S2 camera within the 80 per cent EEF ellipse. 
BKG_M2_SOFT_80 (E): expected number of soft band background 
counts at the source position on the M0S2 camera within the 80 per 
cent EEF ellipse. 

EXPjy[2_SOFT_80 (E): MOS2 camera soft band exposure time in 
seconds at the source position. This parameter is the average expo- 
sure time within the 80 per cent EEF ellipse. 
CNT_TOTAL_HARD_80 (J): hard band counts at the source posi- 
tion from all EPIC cameras estimated within the 80 per cent EEF 
ellipse. 

BKG_TOTAL_HARD_80 (E): expected number of hard band back- 
ground counts at the source position from all EPIC cameras within 
the 80 per cent EEF ellipse. 

EXP_TOTALJIARD_80 (E): hard band exposure time in seconds 
at the source position from all EPIC cameras. This parameter is the 
average exposure time within the 80 per cent EEF elUpse. 
CNT_PN_HARD_80 (J): hard band counts at the source position on 
the PN camera within the 80 per cent EEF ellipse. 
BKG_PN_HARD_80 (E): expected number of hard band back- 
ground counts at the source position on the PN camera within the 
80 per cent EEF ellipse. 

EXP_PN_HARD_80 (E): PN camera hard band exposure time in 
seconds at the source position. This parameter is the average expo- 
sure time within the 80 per cent EEF ellipse. 
CNTJMI JHARD 80 (J): hard band counts at the source position on 
the MOSl camera within the 80 per cent EEF ellipse. 
BKGjyil_HARD_80 (E): expected number of hard band back- 
ground counts at the source position on the MOSl camera within 
the 80 per cent EEF ellipse. 

EXP_M1JIARD_80 (E): MOSl camera hard band exposure time 
in seconds at the source position. This parameter is the average 
exposure time within the 80 per cent EEF ellipse. 
CNTJVI2_HARD_80 (J): hard band covmts at the source position on 
the MOS2 camera within the 80 per cent EEF ellipse. 
BKGJVI2_HARD_80 (E): expected number of hard band back- 
ground counts at the source position on the MOS2 camera within 
the 80 per cent EEF ellipse. 

EXP_M2_HARD_80 (E): MOS2 camera hard band exposure time 
in seconds at the source position. This parameter is the average 
exposure time within the 80 per cent EEF ellipse. 
CNT_TOTAL_VHRD_80 (J): very hard band counts at the source 
position from all EPIC cameras estimated within the 80 per cent 
EEF ellipse. 
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BKG_TOTAL_VHRD_80 (E): expected number of very hard band 
background counts at the source position from all EPIC cameras 
within the 80 per cent EEF ellipse. 

EXP_TOTAL_VHRD_80 (E): very hard band exposure time in sec- 
onds at the source position from all EPIC cameras. This parameter 
is the average exposure time within the 80 per cent EEF ellipse. 
CNT_PN_VHRD_80 (J): very hard band counts at the source posi- 
tion on the PN camera within the 80 per cent EEF ellipse. 
BKG_PN_VHRD_80 (E): expected number of very hard band back- 
groimd counts at the source position on the PN camera within the 
80 per cent EEF ellipse. 

EXP_PN_VHRD_80 (E): PN camera very hard band exposure time 
in seconds at the source position. This parameter is the average 
exposure time within the 80 per cent EEF elUpse. 
CNTJVI1_VHRD_80 (J): very hard band counts at the source posi- 
tion on the MOSl camera within the 80 per cent EEF ellipse. 
BKG_M1 -VHRD_80 (E): expected number of very hard band back- 
ground counts at the source position on the MOSl camera within 
the 80 per cent EEF ellipse. 

EXPJVI1_VHRD_80 (E): MOSl camera very hard band exposure 
time in seconds at the source position. This parameter is the average 
exposure time within the 80 per cent EEF elUpse. 
CNT_M2_VHRD_80 (J): very hard band counts at the source posi- 
tion on the MOS2 camera within the 80 per cent EEF ellipse. 
BKG JV12_VHRD_80 (E): expected number of very hard band back- 
ground counts at the source position on the MOS2 camera within 
the 80 per cent EEF ellipse. 

EXP_M2_VHRD_80 (E): M0S2 camera very hard band exposure 
time in seconds at the source position. This parameter is the average 
exposure time within the 80 per cent EEF ellipse. 
CNT_TOTAL_UHRD_80 (J): ultra hard band counts at the source 
position from all EPIC cameras estimated within the 80 per cent 
EEF ellipse. 

BKG_TOTAL_UHRD_80 (E): expected number of ultra hard band 
background counts at the source position from all EPIC cameras 
within the 80 per cent EEF ellipse. 

EXP_TOTAL_UHRD_80 (E): ultra hard band exposure time in sec- 
onds at the source position from all EPIC cameras. This parameter 
is the average exposure time within the 80 per cent EEF ellipse. 
CNT_PN_UHRD_80 (J): ultra hard band counts at the source posi- 
tion on the PN camera within the 80 per cent EEF ellipse. 
BKG_PN_UHRD_80 (E): expected number of ultra hard band back- 
ground counts at the source position on the PN camera within the 
80 per cent EEF ellipse. 

EXP_PN_UHRD_80 (E): PN camera ultra hard band exposure time 
in seconds at the source position. This parameter is the average 
exposure time within the 80 per cent EEF ellipse. 
CNTJVI1_UHRD_80 (J): ultra hard band counts at the source posi- 
tion on the MOSl camera within the 80 per cent EEF ellipse. 
BKGJV11_UHRD_80 (E): expected number of ultra hard band 
background counts at the source position on the MOSl camera 
within the 80 per cent EEF ellipse. 

EXP_M1_UHRD_80 (E): MOSl camera ultra hard band exposure 
time in seconds at the source position. This parameter is the average 
exposure time within the 80 per cent EEF ellipse. 
CNTJVI2_UHRD_80 (J): ultra hard band counts at the source posi- 
tion on the M0S2 camera within the 80 per cent EEF ellipse. 
BKG_M2_UHRD_80 (E): expected number of ultra hard band 
background counts at the source position on the MOS2 camera 
within the 80 per cent EEF ellipse. 

EXPJVI2_UHRD_80 (E): M0S2 camera ultra hard band exposure 



time in seconds at the source position. This parameter is the average 
exposure time within the 80 per cent EEF ellipse. 
ECFJPNJFULL (E): PN full band counts to flux conversion factor 
in units of 10^^counts/(crg s^^ cm~^). 

ECF_M 1 _FULL (E) : MOS 1 full band counts to flux conversion fac- 
tor in units of 10^^counts/(ergs~^ cm~^). 
ECF_M2_FULL (E): MOS2 full band counts to flux conversion fac- 
tor in units of 10^^counts/(ergs~^ cm~^). 
ECF_PN_SOFT (E): PN soft band counts to flux conversion factor 
in units of lO^^counts/ (erg s^^ crn^^). 

ECF_Ml_SOFT (E): MOSl soft band counts to flux conversion fac- 
tor in units of 10^^counts/(ergs~^ cm~^). 
ECF_M2_SOFT (E): MOS2 soft band counts to flux conversion fac- 
tor in units of 10^^counts/(crgs~^ cm~^). 
ECF_PN_HARD (E): PN hard band counts to flux conversion factor 
in units of lO^'^ counts/ (ergs~^ cm~^). 

ECF_M1_HARD (E): MOSl hard band counts to flux conversion 

factor in units of lO^^counts/ (erg s^^ cm^^). 
ECF_M2JfARD (E): MOS2 hard band counts to flux conversion 
factor in units of 10^ ^counts/ (erg cm~'^). 
ECF_PN_VHRD (E): PN very hard band counts to flux conversion 
factor in units of 10^'^counts/(erg s^^ cm~^). 
ECF_M1_VHRD (E): MOSl very hard band counts to flux conver- 
sion factor in units of 1 ^ ^ count s/(ergs~^cm~^). 
ECFJVI2_VHRD (E): MOS2 very hard band counts to flux conver- 
sion factor in units of 10^ ^counts/ (erg cm~^). 
ECF_PN_UHRD (E): PN ultra hard band counts to flux conversion 
factor in units of lO^^counts/ (erg s^^ crn^^). 
ECFjy[l_UHRD MOSl ultra hard band counts to flux conversion 
factor in units of 10^^counts/(erg cm~^). 
ECF_M2_UHRD (E): MOS2 ultra hard band counts to flux conver- 
sion factor in units of 10^^counts/(orgs^^ cm~^). 
HR_PN (E): PN hardness ratio. A value of -9.99 for HR_PN, 
HR JN-UPPER and HR JNXOWER (for the definition of the lat- 
ter two parameters see below) means that the HR_PN could not be 
estimated, e.g. the source lies outside the PN FOV. 
HR_PN_UPPER (E): 68 per cent confidence level upper limit in 
HR_PN. 

HR_PNXOWER (E): 68 per cent confidence level lower limit in 
HR_PN. 

HRJVIl (E): MOSl hardness ratio. A value of -9.99 for HR_M1, 
HRJVIl .UPPER and HRJVIl iOWER means that the HRJVIl 
could not be estimated, e.g. the source lies outside the MOSl FOV. 
HR^I -UPPER (E): 68 per cent confidence level upper limit in 
HRJVIl. 

HRJVIl .LOWER (E): 68 per cent confidence level lower limit in 
HRJVIl. 

HRJVI2 (E): M0S2 hardness ratio. A value of -9.99 for HRJVI2, 
HRjyi2_UPPER and HR_M2_LOWER means that the HRjyi2 
could not be estimated, e.g. the source lies outside the MOS2 FOV. 
HRjyi2_UPPER (E): 68 per cent confidence level upper limit in 
HRJVI2. 

HRJM2X0WER (E): 68 per cent confidence level lower limit in 
HRJVI2. 

FLUX_FULL (E): 0.5-lOkeV flux in ergs^^ cm"^ In the case of 
upper limit in the flux this parameter is set to zero. 
FLUX_FULLXOWER (E): 0.5-10 keV flux lower limit uncertainty 
(68 per cent). In the case of upper limit in the flux this parameter is 

set to zero. 

FLUX_FULL_UPPER (E): 0.5-10 keV flux upper limit (68 per 
cent) 

FLUX_SOFT (E): 0.5-2keV flux in erg s"^ cm'^. 
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FLUX_SOFT_LOWER (E): 0.5-2 keV flux lower limit (68 per cent) 
FLUX_SOFT_UPPER (E): 0.5-2 keV flux upper limit (68 per cent) 
FLUX_HARD (E): 2-lOkeV flux in ergs'^ cm'^ 
FLUX_HARDXOWER (E): 2-10 keV flux lower limit (68 per 
cent) 

FLUX_HARD .UPPER (E): 2-lOkeV flux upper limit (68 per cent) 
FLUX.VHRD (E): 5-lOkeV flux in ergs'^ cm^^ 
FLUX.VHRDXOWER (E): 5-10 keV flux lower limit (68 per 
cent) 

FLUX_VHRD_UPPER (E): 5-lOkeV flux upper limit (68 per cent) 
FLUX_UHRD (E): 7.5-12keV flux in ergs"^ cm'^ 
FLUX.UHRDXOWER (E): 7.5-12 keV flux lower limit (68 per 
cent) 

FLUX_UHRD_UPPER (E): 7.5-12keV flux upper limit (68 per 
cent) 

LG JSIH.GAL (E): Base 10 logarithm of the Galactic hydrogen col- 
umn density in the direction of the source in units of cm^'^ esti- 
mated from the HI map of Kal berla et al.| ( [2005] l. 
FILTERJN (lOA): XMM's PN filter. 
FILTERJVIOSl (lOA): XMM's MOSl filter. 
FILTERJVIOSl (lOA): XMM's M0S2 filter. 



B3 X-ray optical identification parameters 

SDSSJD (20A): SDSS source identification number. 

LR_SOFT (E): Likelihood of the optical counterpart for X-ray 

sources in the soft-band selected sample. 

LRJfARD (E): Likelihood of the optical counterpart for X-ray 
sources in the hard-band selected sample. 

LR_VHRD (E): Likelihood of the optical counterpart for X-ray 
sources in the very hard band selected sample. 
LR_UHRD (E): Likelihood of the optical counterpart for X-ray 
sources in the ultra hard band selected sample, sample. 
LR_FULL (E): Likelihood of the optical counterpart for X-ray 
sources in the full band selected sample. 

LR (E): Likelihood of the optical counterpart. This parameter is 
estimated for all X-ray sources independent of the detection band. 
REL_SOFT (E): Reliability of the optical counterpart for X-ray 
sources in the soft band selected sample. 

REL_HARD (E): Reliability of the optical counterpart for X-ray 
sources in the hard band selected sample. 

REL_VHRD (E): Reliability of the optical counterpart for X-ray 

sources in the very hard band selected sample. 

REL_UHRD (E): Reliability of the optical counterpart for X-ray 

sources in the ultra hard -band selected sample. 

REL_FULL (E): Reliability of the optical counterpart for X-ray 

sources in the full band selected sample. 

REL (E): Reliability of the optical counterpart. This parameter is 
estimated for all X-ray sources independent of the detection band. 
DR_ARCSEC (E): angular separation in arcsec between the X-ray 
source and the optical counterpart. 

DR_RA_ARCSEC (E): RA offset in arcsec between the X-ray 
source and the optical counterpart. 

DRJ3EC^RCSEC (E): DEC offset in arcsec between the X-ray 
source and the optical counterpart 



B4 SDSS photometry parameters 

petro_u (E): SDSS u-band petrosian magnitude of the optical coun- 
terpart. 



petro_g (E): SDSS g-band petrosian magnitude of the optical coun- 
terpart. 

petroj (E): SDSS r-band petrosian magnitude of the optical coun- 
terpart. 

petro J (E): SDSS i-band petrosian magnitude of the optical coun- 
terpart. 

petro_z (E): SDSS z-band petrosian magnitude of the optical coun- 
terpart. 

petro_err_u (E): SDSS u-band petrosian magnitude eiTor of the op- 
tical counterpart. 

petro_err_g (E): SDSS p-band petrosian magnitude error of the op- 
tical counterpart. 

petro_err_r (E): SDSS r-band petrosian magnitude error of the op- 
tical counterpart. 

petro_errJ (E): SDSS i-band petrosian magnitude error of the opti- 
cal counterpart. 

petro_err_z (E): SDSS z-band petrosian magnitude error of the op- 
tical counterpart. 

modeLu (E): SDSS it-band model magnitude of the optical coun- 
terpart. 

modeLg (E): SDSS g-band model magnitude of the optical coun- 
terpart. 

model_r (E): SDSS r-band model magnitude of the optical coun- 
terpart. 

modelJ (E): SDSS i-band model magnitude of the optical counter- 
part. 

modeLz (E): SDSS 2:-band model magnitude of the optical coun- 
terpart. 

model_err_u (E): SDSS it-band model magnitude error of the opti- 
cal counterpart. 

model_err_g (E): SDSS g-band model magnitude error of the opti- 
cal counterpart. 

model_err_r (E): SDSS r-band model magnitude error of the optical 
counterpart. 

modeLerr J (E): SDSS i-band model magnitude error of the optical 
counterpart. 

model_err_z (E): SDSS z-band model magnitude error of the opti- 
cal counterpart. 

psf_u (E): SDSS tt-band psf magnitude of the optical counterpart 
psf_g (E): SDSS g-band psf magnitude of the optical counterpart 
psf_r (E): SDSS r-band psf magnitude of the optical counterpart 
psf_i (E): SDSS i-band psf magnitude of the optical counterpart 
psf_z (E): SDSS z-band psf magnitude of the optical counterpart. 
psf_err_u (E): SDSS it-band psf magnitude error of the optical 
counterpart. 

psf_err_g (E): SDSS it-band psf magnitude error of the optical 
counterpart. 

psf_err_r (E): SDSS it-band psf magnitude error of the optical coun- 
terpart. 

psf_errJ (E): SDSS it-band psf magnitude error of the optical coun- 
terpart. 

psf_err_z (E): SDSS it-band psf magnitude error of the optical coun- 
terpart. 

ext_u (E): Galactic extinction in magnitudes in the SDSS it-band. 
ext_g (E): Galactic extinction in magnitudes in the SDSS g-band. 
ext_r (E): Galactic extinction in magnitudes in the SDSS r-band. 
extJ (E): Galactic extinction in magnitudes in the SDSS i-band. 
ext_z (E): Galactic extinction in magnitudes in the SDSS 2;-band. 
p_type (J): SDSS photometric type, 3 for optically extended source, 
6 for stellar object. 

GALXAT (D): Galactic latitude of the counterpart. 
GALXONG (D): Galactic longitude of the counterpart. 
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B5 SDSS photometric and spectroscopic redshift parameters 

SDSS_SPEC JD (20A): SDSS spectroscopic identification number, 
if available. 

z (D): spectroscopic redshift 
ztype(D): spectroscopic redshift type 

z_photl (D): Neural Netwotk photometric redshift estimated by the 
CC2 method described by |Oyaizu et al.H2008^ . 
z_err_l (D): photometric redshift error for the CC2 method of |Oy-| 
|aizu et al.H2508| l. 

z_phot2 (D): Neural Netwotk photometric redshift estimated by the 

Dl method described by |Oyaizu et al.|j2008) . 

z_err_2 (D): photometric redshift error for the D 1 method of ]Oyaizu| 

[etallpgog) . 



and upper redshift limits of the sample and z faint, Zbright the red- 
shifts at which the source will become fainter than R = 24.1 mag 
and brighter than R = 18.5 mag, the optical magnitude limits for 
optical spectroscopy. Q{Lx, z) is the solid angle of the X-ray sur- 
vey available to a source with luminosity Lx and column density 
Nh at redshift z (corresponding to a flux fx in the X-ray sensi- 
tivity curve). The estimation of the X-ray flux from Lx takes into 
account the intrinsic source Nh by adopting a power-law spec- 
tral energy distribution with F = 1.9 and photoelectric absorption 
cross sections as described by |Morrison & McCammonl ( |1983^ for 
solar metallicity. 



APPENDIX C: SIMULATIONS: THE CMD OF X-RAY 
SOURCES IN THE AEGIS AS WOULD APPEAR AT LOW 
REDSHIFT 

Differential selection effects in the comparison of the CMD of X- 
ray AGN in the AEGIS sample and the low redshift subset of the 
XMM/SDSS survey are accounted for by performing simulations 
to explore how the rest-frame colours of the AEGIS X-ray sources 
at z « 0.8 would appear in our low redshift (0.03 < z < 0.2) 
XMM/SDSS survey. We choose to transform the high redshift sam- 
ple to low redshifts. This is because the Chandra observations of 
the AEGIS field select AGN at rest-frame energies 1-14 keV at 
z ~ 0.8. That sample is therefore less affected by obscuration com- 
pared to the XMM/SDSS survey, which probes lower rest frame 
energies (~ 2 — 8keV) at z < 0.2. The simulations proceed as 
follows. AEGIS X-ray sources are randomly selected by weight- 
ing each one by 1/Vmax, the inverse of the maximum comoving 
volume within which it can be detected. Each source is placed at 
a random redshift in the range 0.03 — 0.2 and its SDSS r-band 
magnitude is estimated using the KCORRECT routines. The source 
is retained in the sample if it is brighter than the magnitude limit 
r = 17.77 mag of the XMM/SDSS low redshift survey. The 2- 
lOkeV X-ray flux, fx{'2 ~ lOkeV), of the source is also deter- 
mined from the X-ray luminosity, Ljf(0.5 — 10 keV), and hydro- 
gen column density, Nh- The sensitivity curve of the XMM/SDSS 
survey is an estimate of the probability of detecting a source with 
given flux within the surveyed area. A random number is produced 
between and the maximum value of the XMM/SDSS survey 2- 
lOkeV sensitivity curve. If the random number is lower than the 
value of the sensitivity curve at flux fx (2 — 10 keV) the source is 
retained in the sample. A total of 10,000 realisations are performed. 

The estimation of the maximum volume takes into account the 
optical magnitude limits for spectroscopy in the AEGIS, 18.5 < 
R < 24.1 mag, the varying X-ray sensitivity in that field and the 
spectroscopic sampling rate. For source i at redshift z, with intrin- 
sic X-ray luminosity Lx, obscuring hydrogen column density Nh 
and absolute optical magnitude Mo-i the maximum volume is 

Vmax,i{Lx,NH,M,z) = — / n(L X , z) Wi — dz dL , (CI) 

Ho dz 

where dV/dz is the volume element per redshift interval dz and 
Wi is a weight that depends on the observed B — R, R — I colours 
and the ii-band magnitude of the X-ray sources and corrects for the 
spectroscopic sampling rate of the X-ray population. The weights 
are estimated using the "minimal" model described by |Willmer| 
|et al.|p006| . The integration limits are zl — max{zhright, zl) and 
z2 = min{zfaint, zu), where we have defined zl, zu the lower 



